A Stylometric Inquiry into Hyperpartisan and Fake News

نویسندگان

  • Martin Potthast
  • Johannes Kiesel
  • Kevin Reinartz
  • Janek Bevendorff
  • Benno Stein
چکیده

This paper reports on a writing style analysis of hyperpartisan (i.e., extremely onesided) news in connection to fake news. It presents a large corpus of 1,627 articles that were manually fact-checked by professional journalists from BuzzFeed. The articles originated from 9 well-known political publishers, 3 each from the mainstream, the hyperpartisan left-wing, and the hyperpartisan right-wing. In sum, the corpus contains 299 fake news, 97% of which originated from hyperpartisan publishers. We propose and demonstrate a new way of assessing style similarity between text categories via Unmasking—a meta-learning approach originally devised for authorship verification—, revealing that the style of left-wing and right-wing news have a lot more in common than any of the two have with the mainstream. Furthermore, we show that hyperpartisan news can be discriminated well by its style from the mainstream (F1 = 0.78), as can be satire from both (F1 = 0.81). Unsurprisingly, stylebased fake news detection does not live up to scratch (F1 = 0.46). Nevertheless, the former results are important to implement pre-screening for fake news detectors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stylometric features for emotion level classification in news related blogs

Breaking news and events are often posted in the blogosphere before they are published by any media agency. Therefore, the blogosphere is a valuable resource for news-related blog analysis. However, it is crucial to first sort out newsunrelated content like personal diaries or advertising blogs. Besides, there are different levels of emotionality or involvement which bias the news information t...

متن کامل

Fake news propagate differently from real news even at early stages of spreading

Social media can be a double-edged sword for modern communications, either a convenient channel exchanging ideas or an unexpected conduit circulating fake news through a large population. Existing studies of fake news focus on efforts on theoretical modelling of propagation or identification methods based on black-box machine learning, neglecting the possibility of identifying fake news using o...

متن کامل

Automatic Detection of Fake News

The proliferation of misleading information in everyday access media outlets such as social media feeds, news blogs, and online newspapers have made it challenging to identify trustworthy news sources, thus increasing the need for computational tools able to provide insights into the reliability of online content. In this paper, we focus on the automatic identification of fake content in online...

متن کامل

Fake News Detection Through Multi-Perspective Speaker Profiles

Automatic fake news detection is an important, yet very challenging topic. Traditional methods using lexical features have only very limited success. This paper proposes a novel method to incorporate speaker profiles into an attention based LSTM model for fake news detection. Speaker profiles contribute to the model in two ways. One is to include them in the attention model. The other includes ...

متن کامل

Influence of fake news in Twitter during the 2016 US presidential election

We investigate the influence of fake and traditional, fact-based, news outlets on Twitter during the 2016 US presidential election. Using a comprehensive dataset of 171 million tweets covering the five months preceding election day, we identify 30 million tweets, sent by 2.2 million users, which are classified as spreading fake and extremely biased news, based on a list of news outlets curated ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1702.05638  شماره 

صفحات  -

تاریخ انتشار 2017